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Given a directed graph G, algorithms are discussed for finding (i) all paths 
through G with prescribed originating and terminating nodes, (ii) a subset 
of these paths containing all the edges, {Hi) a subset containing all the 
edge-edge transitions, and (iv) a subset containing the most likely paths. 

I. INTRODUCTION 

Informally, a directed graph consists of a set of vertices or nodes 
together with a set of directed edges joining the nodes. (All of the figures 
below show directed graphs; for a formal definition see page 10 of Ref. 1. 
There may be more than one edge with the same originating and termi- 
nating nodes, and the originating and terminating nodes of an edge 
may coincide.) 

Common examples of directed graphs are state diagrams of systems: 
the nodes represent states of the system and an edge directed from 
node N, to node N t means that it is possible for the system to go directly 
from state N { to state N j . 

The following questions concerning the paths through a directed 
graph arose in testing for possible errors sections of the stored program 
of a No. 1 ESS electronic switching system. 2 However, these questions 
and the algorithms for their solution seem of sufficient general interest 
to warrant stating them independently of their origin. 

Given a directed graph G, the questions are: (i) Find the set a of all 
paths through G with prescribed originating and terminating nodes. 
(A path is just what one would expect; a formal definition is given in 
Section II.) (ii) Find a small subset of a which contains every edge 
occurring in a. (Hi) Find a small subset of a which contains all the 
edge-edge transitions occurring in any path in a. (iv) If a probability 
measure is associated with the edges of G, find the most probable paths 
in a. 

These questions and algorithms for their solution are discussed in 
Sections III, V, VI, and VII, respectively. Section II is concerned with 
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the notation used to describe paths, and Section IV with an algorithm 
for partially solving a combinatorial problem encountered in Sections V 
and VI. 

II. NOTATION FOR PATHS 

Definition: A path from node Ni to N 2 in a directed graph is a sequence 
of (not necessarily distinct) edges e x , e 2 , ■ • • , e c with the property 
that there are nodes N t = n x , n 2 , • • • , n t+1 = N 2 such that e,- is 
directed from n { to n i+1 for i = 1, 2, • • • , L The length I of a path is the 
number of edges it contains. 

A path is specified by giving the ordered string e x e 2 ■ ■ • e t of its edges. 
(We are in fact describing paths by the notation used in automata theory 
to describe regular expressions, as given, for example, in Ref. 3 and 
chapter 5 of Ref. 4. However, the treatment given here is self-contained.) 

It is convenient to include in the definition a path of zero length 
(whose endpoints N x and N 2 must coincide). This path is specified by 
the empty string A (not to be confused with the empty set <j>) . 

A collection of paths is specified by the sum of the strings of the 
individual paths. 

If S is a string, S l denotes SS • • • S (i.e., S concatenated i times) 
and S* denotes A + S + S 2 + S 3 + •■• .For example, in Fig. 1 the 
collection of all paths from 

N 2 to N t is 4>, 

N t to Nj is A, 

iVi to N 2 is a, 

N 4 toN 4 is A + / + f + f + ■ • • - /* 

N 2 to N 3 is d + ce + cje + cfe + • • • = d + cf*e, 

N x to N 3 is ad + (ac + b)f*e. 

Parentheses are used in the natural way. The following rules are easily 
verified. Here S is any sum of strings. 

<t> + 8 = S, <i>S = Stj> = 4 

S* = A + S + S 2 + S 3 + • • • 

A* = A 
AS = S 
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Fig. 1 — An example. 

(A + S)* = A + SS* = A + S* = S* 

Si + S 2 S* 2 S X = S*5, ; S l + S 1 S*S a = S.S* 

(S l + S 2 )* = (5? + S%)*. 



III. FINDING ALL PATHS THROUGH A GRAPH 

Let G be a directed graph with n nodes labeled Ni , N 2 , • • • , N„ . 
Methods are given for finding all paths through G having prescribed 
starting node N„ and (not necessarily distinct) terminating node N v . 
We first describe the McNaughton-Yamada Algorithm, which requires 
on the order of n 3 steps. 

Definition: Let a*, denote the set of paths which start at N, , end at N t , 
and do not pass through any intermediate node N p with p > k, for 
k = 0, 1, ■ • • , n, and i, j = 1,2, ■ ■ ■ , n. 

The algorithm successively computes a", for all i and j, then a), for 
all * and j, ■ ■ • , then a" J 1 for all i and j. The final step is to compute 
al, , the set of all paths from N^ to iV„ with no restriction on intermediate 
nodes, which is the desired result. 

The inductive step proceeds as follows. Suppose a k ~) is known for 
all i, j, and we wish to obtain a {j . Referring to Fig. 2, we see that 
the fundamental recurrence equation is 



an = a, 



+ fc— i/ *-i\* it— i 
a. a {auk ) otki ■ 



(1) 



In words, this says that the paths from JV,- to iV, containing intermediate 
nodes as high as k are made up of those containing intermediate nodes 
only as high as k — 1, a'lj 1 , plus all possible paths containing N k as an 
intermediate node, a1t l (a^ l )*a^ 1 . When k is equal to either i or j, 
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.k-i 




Fig. 2 — The inductive step. 

(1) may be simplified. We now give the complete statement of the 
algorithm. 

THE MCNAUGHTON-YAMADA ALGORITHM 3 

1. The Initial Step 

Define a° y for all i, j — 1, • • ■ , n by: 

(1.1) tii^h 

f</» if there is no edge from N, to N, • , 
"" "" \e! + e 2 + • • • if edges labeled e, , e 2 , ■ ■ • join iV, to N s ; 

(1.2) if i = j, 

f A if there is no edge from N t to itself, 
"•'*' \a + e x + e 2 + • • • if edges labeled e t , e 2 , • • • join N { to itself. 

2. The Inductive Step (Refer to Fig. 2) 

For k = 1, 2, • • • , n — 1 compute a*, for all i, j = 1, 2, • • • , n from: 

(2.1) if k ^ i,k ?± jthen 

k A— 1 I It— 1/ fc-l\* *-l. 

oti, = an + a ik {a kk ra kj , 

(2.2) if i t± j and k = i, 

an = {a ti )*<*,,- ; 

(2.3) if i j* j and k = j, 

(2.4) if i = j = k, 
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Fig. 3 — An example. 



3. The Final Step 

Finally, use whichever of (2.1) to (2.4) is appropriate to calculate 
al _„ , the set of all paths from N„ to N y . 

Remark: In steps 2 and 3, after obtaining expressions of the form 
a n „ = • • ■ (/3)* • • • , it may be convenient to simplify (/3)* by means 
of the rules given at the end of Section II. 

An Example: We will use the McNaughton-Yamada algorithm to 
compute the set of all paths in Fig. 3 which start at N x and end at A7 4 , 
or, in other words, aj 4 . 



Step 1. 



«?, 



Step 2. Since there are no paths into N x , a) f = «",- for all i, j. 
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where we have used the rules <f> -\- S = S and </>£ = S<f> = 4>. This may- 
be further simplified using the rules at the end of Section II as follows. 
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Since there are no paths out of iV 4 , «* f = a*, for all i, j. We can 
therefore go directly to Step 3: 

4 3 2i 2/2 \* 2 

an = a™ = «i4 -r ai3k«3aj «34 

= ac*6 + ac*d(A + ec*d)*(/ + ec*6) 
= ac*b + ac*d(ec*d)*(j + ec*b), 
which, if required, can be expanded to give 
aU = ab 4- acb + ac 2 b + • • • 

+ adj 4- ade& -f- adecb + arfec b + • ■ • 
4- adedj 4- adedeb + adedecb + • • • 
4- adecdf 4- adecdeb -\- adecdecb + • • • 
4- acrf/ 4- acdeb 4- acdecfr 4- acdecb -\- ■ ■ ■ 
+ acrferf/ + acdedeb 4- acdedecb + ■ ■ ■ 

+ ••• • 

It may be verified that this includes all possible paths from N t to N t . 
Remarks: (?) When programmed in a computer language capable of 
handling strings, such as snobol4, 5 this algorithm involves the calcula- 
tion of n n X n matrices (requiring on the order of n 3 steps). Enough 
storage space is required to hold two n X n matrices (the current 
[a*,], i, j = 1, • • • , n, matrix and the previously calculated [a*7']» h j — 
1, • • • , n, matrix) each entry of which is a string of letters, parentheses, 
4-'s and *'s. (it) With very little extra work Step 3 can be modified to 
give the paths between several pairs of nodes. This is valuable for 
analyzing large graphs, as we now show. 
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Analysis of Large Graphs by Partitioning 

Since the time required for the McNaughton-Yamada algorithm 
grows as the cube of the number of states, large graphs cannot be 
handled directly. However, such graphs can usually be handled by 
partitioning them into smaller subgraphs, applying the algorithm 
to each subgraph separately, and then reapplying the algorithm to the 
network of subgraphs. The following simple example will illustrate 
the method. 

Figure 4 shows a graph G partitioned into two subgraphs 6', and G 2 
which are interconnected at nodes N 2 and N 3 . (Only edges between 
the subgraphs are shown.) Suppose we wish to find all paths from 
Ni to N t . If G, , G 2 each contain 20 nodes, a direct application of the 
McNaughton-Yamada algorithm would require on the order of 40 3 = 
64,000 steps. This number is considerably reduced by the following 
technique. 

Let fit ,((?,) denote the set of all paths starting at N, , ending at N t , 
and lying entirely in the subgraph G„ . 

We first apply the McNaughton-Yamada algorithm to (7, and G 2 to 
obtain fi it (Oi), i, j = 1, 2, and 0* /(<?»), i, j = 3, 4. That is, we first find 
all the paths between the interconnecting nodes that lie completely in 
one of the subgraphs. (This will take on the order of 2-20 3 = 16,000 
steps.) 

We now replace G by the condensed graph G of Fig. 5. G contains 
(i) nodes .V, , A' 4 corresponding to the terminal nodes iV, , iV 4 , (u) nodes 
A r 2 , A' ;! corresponding to the interconnecting nodes N 2 , N 3 , (Hi) edges 
a, b corresponding to the interconnecting edges a, b of G, and (iv) edges 
corresponding to all the paths (3</(Grj), i, j = 1, 2, and ^^(Gi), i, j = 
3, 4, in G. 

The McNaughton-Yamada algorithm is now used to obtain all paths 
from A', to A' 4 in G. (This takes on the order of 4 3 = 64 steps.) It is 
clear that these paths are exactly all the paths from A^ to A^ in the 
original graph 0. Partitioning into two equal subgraphs has thus 
reduced the number of steps by approximately a factor of four. (Parti- 




G, G 2 

Fig. 4 — A graph partitioned into subgraphs. 
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/» Z ,(G,) b /J4 3 (G 2 ) 

Fig. . r »— The condensed graph corresponding to Fig. 4. 

tioning into k equal subgraphs would reduce it by a factor of about fc 2 .) 
The general method of analyzing a large graph by partitioning 
should now be clear. 

If n is the largest number of nodes that can be directly handled by 
the McNaughton-Yamada algorithm, then it is desirable to partition G 
in such a way that no subgraph has more than n„ nodes, and that the 
total number of interconnecting nodes (which is the number of nodes 
in the condensed graph) is also less than n a . (Of course the sugbraphs 
may themselves be partitioned.) 

IV. THE COVERING PROBLEM 

In Sections V and VI we will encounter a basic combinatorial problem, 
the covering problem, which may be stated as follows. Suppose a set 
S = {s t , s 2 , ■ • ■ , s n \ of n elements is given, together with a family 
tf of subsets of 8, 

S = {X lt X 2 , ••• ,X m } } X. C S. 

The problem is to find a subfamily 3C Q JF, say 

3C = [X u ,X iti ■•• ,X it \, 

where I is as small as possible, such that every element of <S appearing 
in SF also appears in 3C, or formally, such that 

I 1 UJ 2 U-UI. = I <1 UI <I U--U X u . 

3C is called a covering set for J. 

The family $ may be represented by an m X n (0, 1) matrix 9TI = 
(m ,-,-), where 

m ti = 1 if Sj 1 1, , 
if s t i Xi . 

The tth row of 311, written I(X t ), is called the indicator vector of X, , 
since it indicates which elements of S belong to Z, . 



PATHS THROUGH A NETWORK 379 

The problem is to find a minimal set of rows which together contain 
a 1 in every nonzero column. Equivalently, if we relabel the matrix so 
that columns correspond to subsets and rows to elements, the problem 
is to find a minimal system of representatives for the subsets. This 
problem is known to be difficult (Ref. G, page 521). 

The direct attack is to look at the rows taken 1, 2, 3, • • • , m at a time, 
until a covering set is found; this finds a minimal covering set, but may 
take up to 2 m — 1 steps. Several methods 7-13 have been given which 
are faster than the direct attack, but are still impractical for large m. 
Roth's algorithm 14 finds a locally minimal cover which has a high 
probability of being the minimal cover, for quite large values of m 
(up to several hundred). 

However, for our purposes, the following extremely simple (and 
appropriately named) algorithm is adequate. It finds a covering set in 
at most |m 2 steps, but may not find a minimal cover. 

THE GREEDY ALGORITHM 

The algorithm proceeds inductively, starting with 3C = <f> and 
(greedily) adding to .TC, each time that particular X, which will contribute 
the greatest number of new elements. 

We keep track of the elements in JC at each step by means of the 
indicator vector 

/(3C) = J(UA') 

XtX 

and stop when this is equal to 

7(ff) =J(UI). 

1. The Initial Step 

Set OC = <f>, /(JC) - (0, 0, • • • , 0). 

2. The Inductive Step 

Search through all X, t 5 that are not in 3C and find an X k which 
maximizes the number of elements of S which are in A' /; but not in 3C, 
i.e., which maximizes weight (I(X k ). and. not. I(3C)). (The weight of 
a vector is the number of its nonzero components, (a, , • • • , a„). and. 
(6, , • • • , b„) = (a { and bi , • • • , a„ and b„), not. (a, , • • • , a„) = 
(not a, , • • • , not a„), and .or. is defined similarly.) Break ties in any 
way. 

Add X k to 3C, and calculate the new I (3d) = old J(3C). or. I(X k ). 
Repeat Step 2 until Z(3C) = 7(SF); then stop. 
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Remarks: 

(i) The greedy algorithm often finds a covering set which is close to 

minimal, although it is possible to construct examples when the minimal 

covering set contains two subsets while the greedy algorithm uses more 

than N subsets, for any preassigned value of N. Are such examples rare? 

The behavior of the algorithm for a random family $ seems to be 

unknown. 

(m) Since the algorithm involves simple calculations with binary 
vectors it may be easily programmed on a computer. 

Example 1: The set of all paths from iV, to iV 4 in Fig. 6 consists of 

(^ + a 2 )(bi + 6 a ) (ci + c 2 ) = a l b l c l + a,&ic 2 + a^d + a x b 2 c 2 

+ a 2 &iCi + a 2 b x c 2 + a 2 b 2 Ci + a 2 b 2 c 2 . 

Suppose it is desired to find a minimal subset of these paths which 
contains all the edges S = [ai , a 2 , b t , b 2 , c t , c 2 }. 3= consists of the 
following eight subsets of S, shown together with their indicator vectors. 

i X t I(X { ) 



1 


Ctx&iCi 


10 10 10 
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ai&iC 2 


10 10 1 


3 


a t b 2 Ci 


10 110 
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aj) 2 c 2 


10 10 1 
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a 2 b l c 1 


110 10 
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a 2 &iC 2 


110 1 
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a 2 b 2 Ci 


10 110 


8 


a 2 o 2 c 2 


10 10 1 



The greedy algorithm then proceeds as follows. 
Step 1. 3C - 4>, I(X) = 000000. 

Step 2. Weight (/(X,). and. 111111) = 3 for all i, so we pick X, 
(any X< will do) and add it to JC: 3C = {X,}, I(X) = 101010. 
Step 2 again. Weight (i"(X,). and. 010101) is maximized by i = 8. 



a, b, c, 




"■- b 2 ' 

Fig. 6 — An example. 
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Then X = [X, , X 8 \, I(X) = 101010. or. 010101 = 111111. The 
algorithm terminates having found 

3C = [dibiCi , a 2 b 2 c 2 \ 

which is a correct solution. 

Example 2: The greedy algorithm does not always find a minimal 
covering set, as the following example shows. 

S = {1,2,3,4,5,6} 

ff = |X, - (1, 2, 3}, X 2 = {4, 5, 6}, X 3 = [1, 3, 4, 6}. 

The greedy algorithm finds 3C = {Xi , X 2 , X 3 \, while the minimal 
3Cis [X, , X 2 \. 

V. FINDING A SMALL SET OF PATHS CONTAINING ALL EDGES 

As before, let G be a directed graph with nodes iV\ , N 2 , • ■ • , N„ . 
Let a MV denote the set of paths from JV„ to N, . 

Definition: A set /3 M „ of paths from AT„ to N v is said to be a spanning set 
if every edge occurring in the set a^ occurs in /3 M „ . 

Example: In Fig. 7, the set of all paths from N Y to iV 2 is 

ai2 = (a + b)(c + rf) = ac + ad + &c + bd, 

whereas an example of a spanning set is /3 12 = ac + brf. 

The problem we consider in this section is to find a small spanning 
set /3„„ . Finding a minimal spanning set appears difficult, and the only 
method we know is essentially an exhaustive search, as given in the 
next paragraph. The main algorithm of this section, algorithm B, gives 
a small spanning set /3„, with a reasonable amount of computation. 

Finding the Smallest Spanning Set 0„, by Exhaustive Search 

Tins may be accomplished by first applying the McNaughton-Yamada 
algorithm of Section III to produce a condensed list of all paths from 
iV M to A r „ . Then truncate each expression S* appearing in this list to 
A + S. (Since there is no need to go around a loop more than once in 




b d 

Fig. 7 — An example. 
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succession, we can throw away the remaining terms of S* = A + S + 
g 2 _j_ g 3 + • • . .) We now have a. finite spanning set /3„„ and can use an 
exhaustive search to get a minimal set. 

The difficulty with this method is that the number of terms obtained 
in the final list will be very large. To illustrate we apply the method to 
the four-node graph of Fig. 3. We found that the complete set of paths 
from Ni to N t is 

<x u = ac*b + ac*d(ec*d)*(f + ec*b). 

Truncating each S* to A + S, we obtain 

a(A + c)b + a(A + c)d(A +-e(A + c)d)(f + e(A + c)b), 

which, when parentheses are removed, becomes 

ab + acb + adj + adeb + adecb + adedf + adedeb + adedecb 

+ adecdf + adecdeb + adecdecb + actf/ + aafefc + acrfecb + acdedf 

+ acdedeb + acdedecb + acdecdj + acdecdeb + acdecdecb. 

Then by inspection, or from the greedy algorithm of Section IV, we 
find that a minimal spanning set is for example 

/3 14 = adj + adecb. 

An Approximate Solution to the Problem- Algorithm B 

We noticed in the above example that the difficulty was not in 
finding a minimal spanning set— indeed there are a large number of ways 
of choosing one — but rather in the very rapid increase in the number 
of terms to be handled. The algorithm to be described now keeps the 
lists involved small. 

The basic idea is to follow the McNaughton-Yamada algorithm, but 
to use the greedy algorithm twice at each step to reduce the complete path 
sets a*,- t° small covering sets /3j t - . 

Definition: Let /3- ; be a set of paths from N< to N t containing no 
internal node N„ with p > k and containing every edge appearing 
in a*,- . 

Then 0JJ, = j3„» is an example of a spanning set of paths from N M to N, , 
which is what we are seeking. 

The algorithm will form the •,• by induction on k. At each step we will 
keep a record of the edges in 0*, by means of its indicator vector /(j8*,-). 

The inductive step proceeds as follows. Suppose /3*7" is known for 
all i, j, and we wish to obtain /3-,- (see Fig. 8). We restrict ourselves here 
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N, 




Fig. S — The inductive step of algorithm B. 

to the case when i, j and k are distinct, the other cases being left to the 
detailed statement of the algorithm. 

Suppose k k ; 1 = 7\ + T 2 + • • • + T T , where each T a is a path 
from N k to N k . Then a possible choice for jS*,- is 



tij 1 + tf: l T u T is 



Tntii 



(2) 



where we have used just enough T lr 's to include all the edges in 7\ + 
T 2 + • • ■ + T, that were not already contained in 



tfr 1 + fa l &i l . 



A better choice for #, , however, is to obtain (2) and then find a small 
spanning subset of (2) by the greedy algorithm. 
We now give the algorithm. 



ALGORITHM B 

Each /3*', will have the form of a sum of strings of edges, without 
*'s or parentheses. 

1. The Initial Step 
Define $, for all i, j = 1, • • • , n by: 

(1.1) if i * j, 
\<f> if there is no edge from TV,- to A7,- , 
Id + r 2 + • • • if edges labeled c, , c 2 , • ■ ■ join A/,- to N t ; 

(1.2) if i = ./, 

A if there is no edge from N, to itself, 

2 • • • if edges hibeled c t , c 2 , • • • join N , to itself. 



tf,- = 



fit = 
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2. The Inductive Step (Refer to Fig. 8) 

For k = 1, 2, • ■ • , n — 1, compute fa for all i, j - 1, 2, • • • , n 

as follows: 

(2.1) If k ^ i, /c ^ j: 

(2.1.1) Let the terms of jBJk 1 be 

at 1 - 2\ + r> + • • ■ + r r . 

(2.1.2) Form the indicator vector 

/, = lOO.oR.JGtf^.aR. 7(^7'). (2.1.3) 

(This includes all the edges in the three sides of the triangle of Fig. 8.) 

(2.1.4) Using the greedy algorithm, find a small subset of the T.'s 
in (2.1.1) which contains all the edges in 

/(/SJtfc 1 ). AND. NOT. 7, , 

i.e., find a small subset of the terms T a which includes all the new edges 
they contain. Let this subset be T ai + T a , + • • • + T am . 

(2.1.5) Form the set 

/3-7 1 + tfrXX. • • • r.jsJr 1 • • • (2.1.6) 

(By construction, this now contains all the edges visible in Fig. 8.) 
(2.1.7) Apply the greedy algorithm to the set (2.1.6) to find a small 
spanning subset. This is p ti . 

(2.2) If i j* j and k = i, replace (2.1.3) by A + Kfi'tJ 1 ), and replace 

(2.1.6) by 

T ai T a , • ■ ■ TeJij 1 . 

(2.3) If i ^ j and k = j, replace (2.1.3) by 7, = W{j l )s and replace 
(2.1.6) by 

/3j7 1 r Ul r„, ■•■ T am . 

(2.4) Hi = j = k: 

Replace (2.1.3) by 7! = and replace steps (2.1.5) and (2.1.7) by 

A- T ai T at ■■■ T am . 

3. The Final Step 

Use whichever of (2.1) to (2.4) is appropriate to calculate j8J, , the 
desired result. 
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Example: We use algorithm B to obtain a small spanning set /3 M of 
paths from node 1 to node 4 in Fig. 3. 
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The last step will be shown in detail. 

(2.1.1) $ 3 - ecd = 7\ . 

(2.1.2) /, - /(/&). or. 7(/3; 3 ). or. I(/3f 4 ) = 111000. or. 101100. or. 
011011 = 111111. 

(2.1.4) not. 7, = 000000 so no 77s need be used. 

(2.1.5) Pi + #3034 = acb + acd(f + ec6) = ac6 + acrf/ + acdecb. 
(2.1.7) From the greedy algorithm, 3 l4 = /3j 4 = acdecb + acrf/, which 

is a minimal solution (although minimal solutions with shorter strings 
are possible, such as acdeb + adj). 

Remarks: (i) If a fast version of the greedy algorithm is available, the 
computation time for algorithm B should not be much more than for 
the McNaughton-Yamada algorithm, (ii) An edge forming a loop of 
length one may be deleted from any sum of strings in which it appears 
more than once. If there are many such edges the algorithm should be 
modified to make a list of such edges and periodically delete duplicates 
from the /3*, . The modified algorithm would then give the improved 
solution acdeb -f adj to the above example, (in) As in Section III, large 
networks may be handled by partitioning. 
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VI. FINDING A SMALL SET OF PATHS CONTAINING ALL EDGE-EDGE TRANSI- 
TIONS. 

With a„ y defined as before, in this section we consider the problem of 
finding a small subset %,„ of a„, with the property that every edge-edge 
transition appearing in any path from N M to N„ appears in y„ . 

For example, consider the graph of Fig. 6. Here the set of all paths 
from iV, to iV 4 is 

an = (a, + a 2 )(&i + &a)(Ci + c 2 ) 

= ai&jCi + O161C2 + aib 2 Ci + a t b 2 c 2 
+ a 2 b t Ci + a 2 6iC 2 + a 2 b 2 c l + a 2 b 2 c 2 , 
and an example of 7,4 is 

7i4 = OibiC] + aib 2 Ci + a 2 6,c 2 + a 2 b 2 c 2 . 

To check this we observe that a 14 contains eight distinct edge-edge 
transitions: 

afii , a,6 2 , a 2 6i , a 2 b 2 , bid , biC 2 , fr 2 c, , 6 2 c 2 , 

and all of these appear in 7,4 . 

Of course 714 is not unique, another example being 
aib t c 2 + a t b 2 c 2 + a 2 b 1 c 1 + a 2 b 2 Ci . 

The idea of the solution is to construct from G a new graph called the 
transition graph, G T , which will have an edge for every edge-edge 
transition in G, and then to apply algorithm B to G T . 

Suppose then that G is given and it is desired to find 7 M „ . First form 
the augmented graph G by adding to G a node N which is connected 
to JV M by an edge z, , and a node N n+l to which N, is connected by 
an edge z 2 (see Fig. 9). 

From G we construct the transition graph G T as follows. The nodes 
of G T are (i) a node denoted (N„), and (ii) nodes denoted (e a , JV.) , • • • , 
(e, r( , Ni) if edges e a , • ■ ■ , e iri enter iV,- in G, for i = 1, 2, • • • , n + 1. 





G G 

Fig. 9 — Construction of augmented graph G. 
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The edges of G T are (i) an edge from (N„) to (z t , N„), labeled (2,); 
and (m) for every edge-edge transition in G, 

« n> r Nj 

o — > — o — > — o , 

there is a corresponding edge in G T : 

(e,N,)0 > O (/,#,). 

In general, we see that nodes of G T have labels of the form (edge of G, 
node of G), and edges have labels of the form (edge-edge transition 
pair of G). 

By construction, apart from the edge (z t ) of G T , there is a one-to-one 
correspondence between edge-edge transitions in G and edges of G T . 

To find 7 M „ we apply algorithm B to G T . Each path through G T from 
N to N n + l will have the form 

(zi), (fcOi ( e u - Oi («i. » e -=)» ' ' ' I («*r«l) (3) 

and this corresponds uniquely to the path 

e„ ,«*,,••■, e ir (4) 

from A^„ to iV, in G. The process of obtaining (4) from (3) will be called 
contracting. 

We can now state the algorithm. 

Algorithm C for Obtaining y^, 

1. From obtain G and then the transition graph G T . 

2. Apply algorithm B to find a small set of spanning paths from iV„ to 
N n + l mG T . 

3. Contract each of these paths to give a set of paths in G. This is y u , . 

Example: Let G be the graph of Fig. 6. Then G and G T are shown 
in Figs. 10-11. 

Applying algorithm B, or in this case even by inspection, we see that 
a minimal spanning set for Fig. 11 is 

(21) (ziOi) (aibi) (bid) (c,g 2 ) 

+ (zi)(zia,)(a,&a)(&2Ci)(cig 2 ) 
+ (z i ){z i a 2 ){a 2 b l )(b i c 2 ){c 2 z 2 ) 
4- (z x )(z x a 2 ){a 2 b 2 ){b 2 c 2 )(c 2 z 2 ), 
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b, 




N o°— * s. /v /\ s: ■? 

a 2 b 2 c 2 

Fig. 10 — Augmented graph G corresponding to Fig. 6. 

which contracts to give the paths 

fli&iCi + a^Ci + a 2 b { c 2 + a 2 b 2 c 2 , 
the same solution as found before. 

VII. FINDING THE MOST PROBABLE PATHS 

A directed graph G is given with a conditional probability measure 
associated with the edges. More precisely G has nodes N, , • • ■ , N„ , 
and associated with each edge e, directed say from N { to AT, , is the 
conditional probability p r that e will be traversed next, given that the 
last node reached was JV, . 

We wish to find the most probable paths through the graph, starting 
at N„ and ending at N v . The probability of a path is the product of the 
probabilities associated with the edges in the path. 

In other words it is desired to find those paths P for which 

probability (P) = J J p,. 

all (tdgei 
etP 

is the maximum, or is close to the maximum. 
If we label each edge e of G with the "length" 

q e = -log V. 
instead of with p, , an equivalent problem is to find those paths P for 
which 

E i. 

nil ed K ea t P 

is the minimum, or is close to the minimum. In the new graph this 
corresponds to finding the shortest paths between N„ and N, . This 
problem has been extensively studied and many good algorithms for 
its solution are available. We refer the reader to the recent survey by 
S. E. Dreyfus. 15 References 16 and 17 are earlier surveys covering a wide 
range of similar problems. The paper by H. Frank 18 is also relevant. 

VIII. SUMMARY 

Four questions which arise in testing a stored program for possible 
errors are stated quite generally in terms of listing the paths through 
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(C 2 N<) 



(Z 2 N 5 ) 
Fig. 11 — Transition graph G T corresponding to Fig. 10. 

a directed graph. Question 1 may be answered for small graphs by the 
MeNaughton-Yamada algorithm, and for large graphs by partitioning 
(Section III). Question 2 involves a difficult combinatorial problem, 
the minimal covering problem, a partial solution of which is given by the 
appropriately named greedy algorithm of Section IV. With the aid of 
the greedy algorithm, algorithm B solves question 2 (Section V). 
Question 3 is solved by the same method as question 2 (Section VI). 
Question 4 is shown to be equivalent to the widely studied "shortest-path 
problem," and references are given to the appropriate literature (Sec- 
tion VII). 
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